AD-A118  757  MASSACHUSETTS  INST  OF  TECH  CAMBRIDGE  LAB  FOR  INFORMA— ETC  F/6  12/1 

CONVERGENCE  AND  ASYMPTOTIC  AGREEMENT  IN  DISTRIBUTED  DECISION  PR— ETC (U) 
JUL  82  J  N  TSITSIKLIS#  M  ATHANS  N00014-77-C-0532 

UNCLASSIFIED  LIOS-P-1229  NL 


FILE  COPY  AD  A118757 


tnc  i  \ i j 


WCUWTY  CLASSIFICATION  OF  THIS  PAGE  fP»i«n  Dm  Eaim*« 


£ 


REPORT  DOCUMENTATION  PAGE 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


r  report  number 


[eTOOVT  accession  no. 

#7)-  T5~7 


S.  RECIPIENT'S  CATALOO  NUMBER 


4.  TITLE  (and  Su Ainu; 

CONVERGENCE  AND  ASYMPTOTIC  AGREEMENT  IN 
DISTRIBUTED  DECISION  PROBLEMS 


S.  TYPE  OF  REPORT  *  PERIOD  COVEREO 

Paper 


S.  PERFORMING  ORG.  REPORT  NUMBER 

LIDS-P-1229 


I.  contract  or  a  rant  number;*; 


7.  AUTHOR;*; 


John  N.  Tsitsiklis 

Michael  Athans 

ONR/N00014- 77-C-0532 

».  PERFORMING  ORGANIZATION  NAME  ANO  ADORE II 

M.I.T. 

Laboratory  for  Information  and  Decision  Systems 
Cambridge,  MA  02139 

10.  PROGRAM  ELEMENT.  PROJECT.  TASK 
AREA  4  WORK  UNIT  NUMBERS 

II.  CONTROLLING  OFFICE  NAME  ANO  AOORESS 

Office  of  Naval  Research 

12.  REPORT  DATE 

July,  1982 

Arlington,  VA  22217 

IS.  NUMBER  OF  PAGES 

29 

14.  MONITORING  AGENCY  NAME  4  AOORESSf/f  dlllatanl  h am  Controlling  Olllea ) 

IS.  SECURITY  CLASS,  (ml  111  la  taport) 

Unclassified 

I  16.  DISTRIBUTION  STATEMENT  (ot  thim  Rmpoel) 

Approved  for  public  release:  distribution  unlimited 


This  paper  will  appear  in  the  Proceedings  of  the  1982  IEEE  Conference  on 
Decision  and  Control;  it  has  also  been  submitted  to  the  IEEE  Transactions  on 
Automatic  Control. 

It.  KEY  WORDS  (Cantlnu*  an  ravataa  ml  da  II  nacaaaary  and  Idantltr  N  Flee*  runnier; 


m 


_k- 


20.  nmSTHACT  (Canllnua  an  ravataa  alda  II  naeaaaarr  and  Idanlllr  ky  klmek  mmikat)  Consider  a  distributed 

Iteam  decision  problem  in  which  different  agents  obtain  from  the  environment  dif¬ 
ferent  stochastic  measurements,  possibly  at  different  random  times,  related  to  the 
same  uncertain  random  vector.  Each  agent  has  the  same  objective  function  and  prior] 
reliability  distribution.  We  assume  that  each  agent  can  compute  an  optimal  tenta¬ 
tive  decision  based  upon  his  own  observation. and  that  these  tentative  decisions  are] 
communicated  and  received,  possibly  at  random  times,  by 'a  subset  of  other  agents. 
Conditions  for  asymptotic  convergence  of  each  agent's  decision  sequence  and 
symptotic  agreement  of  all  agents'  decisions  are  derived.— '  _  _ 


4 


July  1982 


LIDS-P  -1229 
(Revision  of  LIDS-  P-1185) 


Wc  consider  a  distributed  team  decision  problem  in  which  different  agents  obtain  from  the  environ¬ 
ment  different  stochastic  measurements,  possibly  at  different  random  times,  related  to  the  same 
uncertain  random  vector.  Each  agent  has  the  same  objective  function  and  prior  probability  distribu¬ 
tion.  Wc  assume  that  each  agent  can  compute  an  optimal  tentative  decision  based  upon  his  own 
observation  and  that  these  tentative  decisions  arc  communicated  and  received,  possibly  at  random 
times,  by  a  subset  of  other  agents.  Conditions  for  asymptotic  convergence  of  each  agent’s  decision 
sequence  and  asymptotic  agreement  of  all  agents’  decisions  are  derived. 


*  Research  was  supported  by  the  Office  of  Naval  Research  under  contract  ONR/N00014-77-C-0532 
(Nil  041-519) 

**Room  35-406,  Laboratory  for  Information  and  Decision  Systems,  Massachusetts  Institute  of 
Technology,  Cambridge,  Mass.,  02139. 

This  paper  will  appear  in  the  Proceedings  of  the  1982  JEEE  Conference  on  Decision  and  Control;  it 
has  also  been  submitted  to  the  IEEE  Transactions  on  Automatic  Control. 


on 


Consider  the  following  situation:  A  set  {1, . . N}  of  N  agents  possessing  a  common  model  of  the 
world  (same  prior  probabilities)  and  having  the  same  cost  function  (common  objective)  want  to  make 
an  optimal  decision.  Each  agent  bases  his  decision  on  a  set  of  observations  he  has  obtained  and  we 
allow  these  observations  to  be  different  for  each  agent.  Given  this  setting,  the  decisions  of  the  agents 
will  be  generally  different.  Aumann  (4]  has  shown,  however,  that  agreement  is  guaranteed  in  the 
following  particular  case:  If  the  decision  to  be  made  is  the  evaluation  of  the  posterior  probability  of 
some  event  and  if  all  agents’  posteriors  are  common  knowledge,  then  all  agents  agree.  (In  Aumann's 
terminology,  common  knowledge  of  an  event  means  that  all  agents  know  it,  all  agents  know  that  all 
agents  know  it,  and  so  on,  ad  infinitum.  ) 

The  situation  where  each  agent's  posterior  is  common  knowledge  is  very  unlikely,  in  general. 
On  the  other  hand,  if  agreement  is  to  be  guaranteed,  posteriors  have  to  be  common  knowledge.  The 
problem  then  becomes  how  to  reach  a  state  of  agreement  where  decisions  are  common  knowledge, 
starting  from  an  initial  state  of  disagreement. 

Geanakoplos  and  Polemarchakis  [7]  and  Borkar  and  Varaiya  [6]  gave  the  following  natural 
solution  to  the  above  problem:  Namely,  agents  start  communicating  to  each  other  their  tentative 
posteriors  (or,  in  the  formulation  of  [6]  the  conditional  expectation  of  a  fixed  random  variable)  and 
then  update  their  own  posterior,  taking  into  account  the  new  information  they  have  received.  In 
the  limit,  each  person’s  posterior  converges  (by  the  martingale  convergence  theorem)  and  assuming 
that  “enough”  communications  have  taken  place,  they  all  have  to  converge  to  a  common  limit. 

The  above  results  hold  even  when  each  agent  obtains  additional  raw  observations  during  the 
adjustment  process  and  when  the  history  of  communications  is  itself  random.  Similar  results  were 
also  proved  for  a  detection  problem  [6]. 

A  related  -  and  much  more  general  situation  -  is  the  subject  of  this  paper;  we  assume  that 
the  agents  are  not  just  interested  in  obtaining  an  optimal  estimate  or  a  likelihood  ratio,  but  their 
objective  is  to  try  to  minimize  some  common  cost  function,  given  the  available  information.  (Clearly, 
if  each  agent  has  a  different  cost  function  no  agreement  is  possible  even  if  each  agent  had  identical 
information.)  In  this  setting,  we  assume  that  agents  communicate  to  each  other  tentative  decisions 
(which  initially  will  be  different).  That  is,  at  any  time,  an  agent  computes  an  optimal  decision  given 
the  information  he  possesses  and  communicates  it  to  other  agents.  Whenever  an  agent  receives  such 
a  message  from  another  agent,  his  information  essentially  increases  and  he  will,  in  general,  update  his 
own  tentative  decision,  and  so  on.  In  the  sequel  we  prove  that  the  qualitative  results  obtained  in  (6], 


[7]  for  the  estimation  problem  (convergence  and  asymptotic  agreement)  are  also  valid  for  the  decision 
making  problem  for  several,  quite  general,  choices  of  the  structure  of  the  cost  function.  However, 
tentative  decisions  do  not  form  a  martingale  sequence  and  a  substantially  different  mathematical 
approach  is  required  for  the  proofs.  We  point  out  that  estimation  problems  are  a  special  case  of  the 
decision  problems  studied  in  this  paper,  being  equivalent  to  the  minimization  or  the  mean  square 
error. 

A  drawback  of  the  above  setting  is  that  each  agent  is  assumed  to  have  an  infinite  memory.  We 
have  implicitly  assumed  that  the  knowledge  of  an  agent  can  only  increase  with  time  and,  therefore, 
he  has  to  remember  the  entire  sequence  of  messages  he  has  received  in  u.e  past.  There  is  also 
the  implicit  assumption  that  if  an  agent  receives  additional  raw  data  from  the  environment,  while 
the  communication  process  is  going  on,  these  data  are  remembered  forever.  These  assumptions  are 
undesirable,  especially  if  the  agents  are  supposed  to  model  humans,  because  limited  memory  is  a 
fundamental  component  of  the  bounded  rationality  behavior  of  human  decision  makers  (14).  We  will 
therefore  relax  the  infinite  memory  assumption  and  allow  the  agents  to  forget  any  portion  of  their 
past  knowledge.  We  only  constrain  them  to  remember  their  most  recent  decision  and  the  most  recent 
message  (tentative  decision)  coming  from  another  agent.  (For  a  particular  class  of  communication 
protocols,  we  even  allow  them  to  forget  their  most  recent  decision.)  We  then  obtain  convergence 
results  similar  to  those  obtained  for  the  unbounded  memory  model,  although  in  a  slightly  weaker 
sense. 

A  parti*  ul.u  pioblein  of  interest  is  one  in  which  all  random  variables  are  jointly  Gaussian  and  the 
cost  is  a  quadratic  function  of  an  unknown  state  of  the  world  and  the  decision.  It  was  demonstrated 
in  [6]  that  the  common  limit  to  which  decisions  converge  (for  the  estimation  problem)  is  actually 
the  centralized  estimate,  i.e.  the  estimate  that  would  be  obtained  if  all  agents  were  to  communicate 
their  detailed  observations.  We  prove  (section  4)  that  the  same  is  true  in  the  presence  of  memory 
limitations,  provided  that  each  agent  never  forgets  his  own  raw  observations.  (That  is,  he  may  only 
forget  past  tentative  decisions  sent  to  him  by  other  agents.)  We  indicate  that  for  linear  quadratic 
Gaussian  (LQG)  problems  our  scheme  is  essentially  a  decomposition  algorithm  for  solving  static  linear 
estimation  problems.  As  wc  point  out  in  section  4,  this  scheme  has  certain  appealing  features:  there 
is  significant  parallelism  in  the  computations  which  matches  nicely  with  the  assumed  distribution  of 
the  data;  also,  in  the  course  of  the  algorithm,  acceptable  estimates  are  obtained  much  earlier  than 
the  time  that  would  be  needed  to  compute  the  optimal  estimate  by  centralizing  the  information. 
These  tentative  estimates  can  be  very  useful  whenever  there  are  strict  time  limits  within  which 
certain  decisions  have  to  be  made. 
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We  also  consider  (section  5)  a  slightly  different  scheme  in  which  each  agent  transmits  his  tentative 
decision  to  a  coordinator.  The  latter  evaluates  a  weighted  average  of  the  tentative  decisions  he  has 
received  and  sends  it  back  to  all  agents.  We  show  that  our  results  remain  valid  for  this  scheme  as 
well  and  suggest  an  economic  interpretation  in  which  the  coordinator  can  be  viewed  as  some  sort 
of  market  mechanism.  We  also  show  that  making  optimal  tentative  decisions  corresponds  to  Nash 
strategies  for  a  certain  sequential  game. 

A  weak  point  of  the  model  is  that  not  only  each  agent  has  the  same  prior  information  and 
Knows  the  statistics  of  the  other  agents’  observations  but  also  has  the  same  model  of  the  probabil¬ 
istic  mechanism  that  generates  inter-agent  communications.  In  particular,  if  this  is  a  deterministic 
mechanism,  an  agent  must  know  the  precise  history  of  communications  between  any  pair  of  other 
agents,  a  strong  requirement.  If  it  is  a  stochastic  mechanism,  then  there  are  two  possibilities:  either 
the  history  of  communications  becomes  commonly  known  on-line  (at  the  expense  of  additional 
communications)  or  each  agent  will  have  to  make  probabilistic  inferences  about  the  communications 
between  all  other  agents.  These  weaknesses  disappear,  however,  if  every  tentative  decision  is  broad¬ 
cast  simultaneously  to  all  other  agents,  at  each  stage.  In  that  case  the  history  of  communications 
is  simple,  commonly  known  and  easy  to  remember.  (This  will  be  the  case,  for  example,  if  a  set  of 
experts  with  the  same  objective  tclcconfer  and  take  turns  into  suggesting  what  they  believe  to  be 
the  optimal  decision.) 

Finally,  we  point  out  the  ways  in  which  our  scheme  is  different  from  other  schemes  for  dis¬ 
tributed  decision  making  or  computation:  In  team  decision  theory  {9]  each  agent  tries  to  behave 
optimally,  while  trying  to  anticipate  the  behavior  of  the  other  agents;  the  issue  of  who  implements 
what  component  of  the  decision  vector  is  very  important;  we  are  interested  instead  in  consensus  and 
in  a  common  decision,  quite  independently  of  implementation  issues.  In  many  schemes  for  distributed 
computation  [2,5]  each  agent  specializes  in  updating  only  those  components  of  the  decision  vector 
that  have  been  assigned  to  him,  whereas  in  our  scheme  each  agent  updates  the  entire  decision  vector. 

Motivation. 

There  are  many  real  world  situations  in  which  several  agents  (or  processors)  with  different  on¬ 
line  information  have  to  cooperate,  combine  their  information  and  arrive  at  a  common  decision. 
Examples  can  be  drawn  from  power,  air  traffic  control  and  command  and  control  systems.  What 
distinguishes  such  situations  is  that: 

(i)  There  are  often  rigid  time  limits  within  which  preliminary  or  final  decisions  must  be  made. 
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(ii)  There  are  often  communications  limitations,  restricting  the  number  and  the  nature  of  the  messages 
that  can  be  exchanged.  This  is  particularly  true  in  tactical  command  and  control  systems  [3]. 

(iii)  Conflict  resolution  procedures  involving  higher  levels  of  the  decision  making  hierarchy,  are 
undesirable  because  they  are  likely  to  result  in  delays  and  tend  to  overload  these  higher  levels. 

The  need  arises  for  a  scheme  that  leads  to  a  final  common  decision  (consensus)  while  taking  into 
consideration  the  above  limitations.  While  our  scheme  guarantees  exact  agreement  after,  possibly,  an 
infinite  number  of  exchanges  of  messages,  in  most  situations  approximate  agreement  is  acceptable. 
Our  scheme  then,  is  appropriate  whenever  empirical  or  theoretical  considerations  indicate  that  the 
average  number  of  communications  required  for  approximate  agreement  satisfies  the  time  constraints 
imposed  by  the  actual  situation. 

A  scheme  that  would  involve  the  centralization  of  all  data  (by  communicating  them  to  a  predeter* 
mined  agent)  is  usually  undesirable  for  the  following  fundamental  reasons:  It  is  often  the  case  that  too 
many  data  are  available,  which  would  overload  communications  channels;  moreover,  good  decisions 
can  often  be  made  on  the  basis  of  aggregations  of  the  initial  data;  furthermore,  if  an  agent  is  a 
model  of  a  human  the  plethora  of  data  would  saturate  his  short  term  memory.  This  implies  that  it 
is  preferable  to  communicate  aggregate  data.  Determining  an  optimal  way  to  “aggregate"  is  not  a 
well-DOScd  problem.  If  constraints  n-c  placed  on  the  number  of  bits  to  be  transmitted,  the  problem 
becomes  computationally  intract  :.de,  even  in  very  simple  situations,  where  there  is  only  a  finite 
number  of  possible  events  and  decisions  [11].  On  the  other  hand,  if  a  message  is  allowed  to  be  any  real 
number,  all  data  can  be  coded  in  a  single  message.  Also,  for  any  fixed  aggregation  protocol,  an  agent 
could  slightly  change  his  message  and  code  all  information  in  the  least  significant  bits  of  his  message. 
(This  is  reminiscent  of  decentralized  control  problems  in  which  an  agent  may  observe  the  decisions 
of  other  agents  -  the  so-called  control  sharing  pattern  [1,13].)  Any  such  trick  is  very  sensitive  to 
noise  in  the  channel  and  is  effectively  just  a  more  complicated  way  of  centralizing  information.  Since 
centralization  was  deemed  undesirable  in  the  first  place,  any  indirect  way  of  centralizing  should  be 
also  undesirable  and  explicitly  prohibited. 

The  above  discussion  implies  that  a  particular  aggregation  of  the  data  should  be  chosen  by 
means  of  some  ad-hoc  rule  that  guarantees  that  certain  desirable  characteristics  are  present.  Unless 
some  particular  structure  on  the  problem  is  assumed,  the  optimal  tentative  decision  given  an  agent’s 
information  seems  to  be  a  very  natural  message  that  an  agent  could  transmit  and  this  is  the  reason 
that  we  have  adopted  such  a  framework  in  this  paper. 
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In  this  section  we  present  a  mathematical  formulation  of  the  model  informally  described  in 
the  introduction.  We  start  with  the  general  assumptions  and  later  proceed  to  the  development  of 
alternative  specialized  models  to  be  considered  (e.g.  memory  limitations,  particular  forms  of  the  cost 
function  etc.).  As  far  as  the  description  of  the  sequence  of  communications  and  updates  goes,  we 
basically  adopt  the  model  of  Borkar  and  Varaiya  [6]  except  that  time  is  considered  to  be  discrete.  As 
in  [6],  events  are  timed  with  respect  to  a  common,  absolute  clock.  As  far  as  notation  is  concerned, 
we  will  use  subscripts  to  denote  time  and  superscripts  to  denote  agents. 

W"  assume  that  we  are  given  a  set  {1 . N}  of  N  agents,  an  underlying  probability  space 

(0,9, 9)  and  a  real  valued  cost  function  c:fi  X  U  >-*  R,  where  U  is  the  set  of  admissible  values  of 
the  decision  variable.  It  will  be  useful  in  the  sequel  to  distinguish  between  elements  of  U  and  U- 
valued  random  variables.  The  letter  v  will  be  used  to  denote  elements  of  U  whereas  u,  w  will  be 
used  to  denote  {/-valued  random  variables  (measurable  functions  from  ft  to  U). 

Assumption  1:  Either 

(1.1) :  U  is  a  finite  set,  or 

(1.2) :  U  —  Rn,  for  some  n. 

Assumption  2:  The  cost  function  c  is  nonnegative  and  jointly  measurable  in  (w,u).  Moreover, 
E\c(v)\  <  oo,  Vw  S  U.  When  assumption  (1.2)  holds,  we  assume  that  there  exists  a  positive  and 
measurable  function  4:ft  *-»  R  such  that 

~  *>2l|2<^[c(w,  «i)  +  c(w,  o2)l  —  c^w,  Vweft,  Vvi.^ei/.  (1) 

(Remark:  If  we  fix  uj,  pj,  vij&vi  and  take  expectations  of  both  sides  of  (1),  it  follows  that  A  is 
integrable.) 

Inequality  1  implies  that  c  is  a  strictly  convex  function  of  v  and  strict  convexity  holds  in  a 
uniform  way,  for  any  fixed  uj  6  ft.  It  also  follows  that  c(u>,  v)  is  continuous  for  any  w  (=  ft.  This 
assumption  is  satisfied,  in  particular,  if  c  is  twice  continuously  difTercntiable  in  v  and  its  Hessian  is 
positive  definite,  uniformly  in  v,  for  any  fixed  u>  €  ft. 

We  may  use  the  function  A,  defined  in  assumption  2,  to  define  a  new  measure  n  on  (ft,*?)  by 

n(B)~  [  A[u)d*f[w),  B£  9.  (2) 

Jo 


This  measure  will  be  used  in  section  3. 


We  now  consider  the  generic  situation  facing  agent  i  at  some  time  n.  Let  9),  C  ?  be  a  0-field 
of  events  describing  the  information  possessed  by  agent  i  at  time  n.  Because  of  assumption  2,  the 
conditional  expectation  £(c(v)|9fj  exists  (is  finite),  is  ^-measurable  and  is  uniquely  determined  up 
to  a  set  of  measure  zero,  for  any  fixed  v  £  U.  Agent  t  then  computes  a  tentative  decision  u{,  that 
minimizes  £[c(v)|5*J.  The  following  Lemma  (proved  in  the  Appendix)  stages  that  u{,  is  well-defined 
and  ^-measurable. 

Lemma  1:  Under  assumptions  1.2,  2,  there  exists  a  ^-measurable  random  variable  which  is 
unique  up  to  a  set  of  measure  zero,  such  that 

£,(c( u |«r^J <£7(c(tw) |«F ,  almost  surely,  (3) 

for  any  [/-valued,  ^-measurable  random  variable  w.  The  same  results  are  true,  (except  for  unique¬ 
ness)  under  assumptions  1.1,  2. 

VVe  continue  with  a  description  of  the  process  of  communications  between  agents.  When,  at 
time  n,  agent  t  computes  his  tentative  optimal  decision  u|t,  he  may  communicate  it  to  any  other 
agent.  (If  uj,  is  not  unique,  a  particular  minimizing  ti|,  is  selected  according  to  some  commonly 
known  rule.)  Whether,  when  and  to  which  agents  u'n  is  to  be  sent  is  a  random  event  whose  statistics 
arc  described  by  (fi,  7,  ?).  In  particular,  it  may  depend  or.  the  data  possessed  by  agent  i  at  time  n. 
So,  we  implicitly  allow  the  agents  to  influence  the  process  of  communications,  although  we  do  not 
require  this  influence  to  be  optimal  in  any  sense.  This  allows  the  possibility  of  signalling  additional 
information,  beyond  that  contained  in  uj,,  by  appropriately  choosing  when  and  to  which  agents  to 
communicate.  We  allow  the  communication  delays  to  be  random  but  finite.  We  also  assume  that 
when  an  agent  receives  a  message  he  knows  the  identity  of  the  agent  who  sent  it. 

We  now  impose  .conditions  on  the  number  of  messages  to  be  communicated  in  the  long  run; 
these  conditions  are  necessary  for  agreement  to  be  guaranteed.  Namely,  we  require  that  there  is  an 
indirect  communication  link  from  any  agent  to  any  other  agent  which  is  used  an  infinite  number  of 
times.  This  can  be  made  precise  as  follows: 

Let  A{i)  be  the  set  of  all  agents  that  send  an  infinite  number  of  messages  to  agent  i,  with 
probability  1.  Then,  we  make  the  following  assumption: 

Assumption  3:  There  is  a  sequence  mi, . . .,  m*-t-i  —  m i  of  not  necessarily  distinct  agents  such  that 
m,  £  A(mj+i),  *  =  1,2, ..  .,k.  Each  agent  appears  at  least  once  in  this  sequence. 


The  main  consequence  of  assumption  3,  which  will  be  repeatedly  used,  is  the  following:  If 

{h‘:  1, _ ,  N}  is  a  set  of  numbers  such  that  h‘<hJ,  V?  £  A(i),  Vi,  then  h *  =  hJ,  Vi,  j. 

We  continue  with  a  more  detailed  specification  of  the  operation  of  the  agents.  We  introduce 
assumptions  on  the  knowledge  which  are  directly  related  to  the  properties  of  the  memory  of  agent 
i.  An  agent  may  receive  (at  any  time)  observations  on  the  state  of  the  world  or  receive  tentative 
decisions  (messages)  of  other  agents.  The  knowledge  of  an  agent  at  some  time  will  be  a  subset 
(depending  on  the  properties  of  his  memory)  of  the  total  information  he  has  received  up  to  that 
time.  We  consider  four  alternative  models  of  memory,  formalized  with  the  four  assumptions  that 
follow. 

Let  w'n  be  any  message  received  by  agent  i  at  time  n.  Our  most  general  assumption  requires 
that  w'n  and  are  remembered  at  time  n: 

Assumption  4:  (Imperfect  Memory)  For  all  n,  the  o-field  tfj,  is  such  that  and  uij,  arc  5),* 
measurable. 

Assumption  4  can  be  further  weakened  if  some  restrictions  are  imposed  on  the  communications 
protocol: 

Assumption  5:  (Imperfect  Memory)  For  each  n  there  exists  a  set  I[n)  of  agents  such  that: 

a)  uj,_,  £  <J{,  Vi  £  I(n  —  1),  V/  £  f(n) 

b)  CJ Vi  not  in  f(n). 

Intuitively,  /(»)  is  the  set  of  agents  that  update  their  decision  at  time  n.  Assumption  5  is  satisfied 
by  the  following  two  common  communication  protocols,  provided  that  agent  i  may  obtain  additional 
observations  only  at  times  such  that  i  £  /(n): 

Ring  Protocol:  /(n)  =  {k},  where  k  is  the  unique  integer  such  that  l<k<N  and  k  -f  mN  =  n, 
for  some  integer  m.  Here,  exactly  one  agent  updates  at  any  time  instance  and  communicates  his 
tentative  decision  to  the  next  agent,  and  so  on. 

Star  Protocol:  /(n)  =  {!,.. .,  /V  —  I},  if  n  is  odd;  I(n)  =  {N},  if  n  is  even.  Here  all  agents  but  the 
last  one  update  simultaneously,  communicate  to  the  last  agent  who  updates  and  communicates  to 
all  other  agents  and  so  on. 

Assumption  6:  (Own  Data  Remembered)  Let  G'„  be  the  subfield  of  describing  all  information  that 
has  been  observed  by  agent  i  up  to  time  n,  except  for  the  messages  of  other  agents.  We  assume  that 

cic?' 

With  assumption  8,  we  allow  the  agents  to  forget  the  messages  they  received  in  the  past,  but  they 
are  restricted  to  remember  all  their  past  observations.  In  this  case  the  total  information  available 
to  all  agents  is  preserved. 


7 


Assumption  7:  (Perfect  Memory)  We  let  assumptions  4  and  6  hold  and  assume  that  9),  C  ^'n+v 
Vi,  n. 

Whenever  assumption  7  holds,  we  will  denote  by  7^,  the  smallest  c- field  containing  9'n,  for  all 


We  conclude  this  section  by  defining  a  few  special  cases  of  particular  interest: 

(i)  Estimation  Problem:  We  are  given  a  ’‘-valued  random  vector  x  on  (Q,  If,  SP).  The  objective  is 
to  minimize  the  mean  square  error.  Hence,  the  cost  function  is  c(o)  =  (*  —  v)T{x  —  v),  where  T 
denotes  transpose.  It  is  easy  to  see  that  this  is  a  particular  case  of  a  strictly  convex  function  covered 
by  assumption  2,  with  /t(u>)  being  a  constant. 

(ii)  Static  Linear  Quadratic  Gaussian  Decision  Problem  (LQG):  Let  x  be  an  unknown  random  vec¬ 


tor.  Let  the  sequence  of  transmission  and  reception  times  be  deterministic.  We  assume  that  the 
random  variables  observed  by  the  agents  are  zero  mean  and,  together  with  x,  jointly  normally  dis¬ 
tributed.  We  allow  the  total  number  of  observations  to  be  infinite.  Let  U  —  Rn.  The  objective  is  to  fix 
v  so  as  to  minimize  the  expectation  of  the  quadratic  cost  function  c(v)  —  vTRv-\-xTQv,  with  R  >  0. 
It  follows  that  the  optimal  tentative  decision  of  agent  i  at  time  n  is  uf,  =  G£’[x|cJ,‘J=  E[Gx |5jj, 
where  G  is  a  prccomputable  matrix.  If  we  redefine  the  unknown  vector  x  to  be  equal  to  Gx  instead 
of  x,  we  conclude  that  we  may  restrict  to  estimation  problems,  without  loss  of  generality. 

(iii)  Finite  Probability  Spaces:  Here  we  let  fi  be  a  finite  set.  Then,  there  exist  finitely  many  o-ficlds 
of  subsets  of  n.  Strict  convexity  implies  that  for  each  a-ficld  C  '$  and  any  u;  £  Q  of  positive 
probability  there  exists  a  unique  optimal  tentative  decision.  This  implies  in  turn  that  tentative 
decisions  take  values  in  some  finite  subset  of  U,  with  probability  1.  We  will  therefore  assume,  without 
loss  of  generality,  that  U  is  a  finite  set. 
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In  this  section  we  state  and  discuss  our  main  results.  All  proofs  can  be  found  in  the  Appendix. 
Assumptions  2  and  3  will  be  assumed  throughout  the  rest  of  the  paper  and  will  not  be  explicitly 
mentioned  in  the  statement  of  each  theorem.  We  start  with  the  least  Restrictive  assumptions  on 
memory: 

Theorem  1:  We  assume  that  transmissions  and  receptions  are  deterministic,  that  communication 
delays  are  bounded  and  that  the  time  between  two  consecutive  transmissions  from  agent  j  to  agent 
t  (with  j  €  A(i))  is  bounded.  Then,  under  assumptions  12  (convex  costs)  and  either  assumption  4 
or  5  (imperfect  memory): 

a)  limn_00(u,ri_j_1  —  =  0,  in  probability  and  in  7,/x). 

b)  limn-^o(u),  —  u£)  —  0,  Vi,  j,  in  probability  and  in  L^ft,  7,/i). 

Consider  the  following  situation:  At  time  zero,  before  any  observations  are  obtained,  the  se¬ 
quence  of  transmissions  and  receptions  is  selected  in  random,  according  to  a  statistical  law  which  is 
independent  from  all  observations  to  be  obtained  in  the  future  and  from  c( v),  for  any  v  £  U.  In  other 
words,  communications  do  not  carry  any  information  relevant  to  the  decision  problem,  other  than 
the  content  of  the  message  being  communicated.  Suppose  that  the  sequence  of  communications  that 
has  been  selected  becomes  known  to  all  agents.  From  that  point  on,  the  situation  is  identical  with 
that  of  deterministic  communications.  In  fact,  a  moment’s  thought  will  show  that  it  is  sufficient  for 
the  history  of  communications  to  become  commonly  known  as  it  occurs:  agent  t  only  needs  to  know, 
at  time  n,  what  communications  have  occured  up  to  that  time,  so  that  he  can  interpret  correctly 
the  meaning  of  the  messages  he  is  receiving. 

We  can  formalize  these  ideas  as  follows:  We  are  given  a  product  probability  space  (ft  X  ft*,  9  X 
7*,  7  x  where  (ft,  7,  7)  describes  the  decision  problem  and  where  (ft*,  7*,  9*)  describes  the 

communications  process.  We  assume  that  for  each  u/*  £  ft*,  the  resulting  process  of  communications 
satisfies  the  assumptions  of  theorem  1.  Then,  note  that  each  u>*  £  ft*,  we  obtain  a  distributed 
decision  problem  on  (ft,  7, 7)  with  deterministic  communications.  In  that  case: 

Theorem  2:  Under  assumptions  1.2,  either  4  or  5,  and  independent,  commonly  known  communica¬ 
tions  (as  described  above),  limn_oc(u,n+1  —  u‘„)  =  limn_ao(u‘n— uf,)  =  0,  in  probability  with  respect . 
to  7  X  7*. 


Strictly  speaking,  Theorems  1  and  2  do  not  guarantee  convergence  of  the  decisions  of  each  agent. 
Suppose,  however,  that  the  agents  operate  under  the  following  rule:  Fix  some  small  7  >  0.  Let  the 
sequence  of  communications  and  updates  of  tentative  decisions  take  place  until  \u'n  —  u£|  <  7,  Vi,  jf 
(small  disagreement)  and  |u[l+1  —  u'J  <  7,  Vi  (small  foreseeable  changes  in  tentative  decisions). 
Then,  we  obtain: 

Corollary  1:  With  the  above  rule  and  the  assumptions  of  Theorems  1  or  2,  the  process  terminates 
in  finite  time,  with  probability  l,  for  any  7  >  0. 

When  n  and  IJ  are  finite,  convergence  and  agreement  are  obtained  after  finitely  many  stages: 

Theorem  3:  If  0  and  U  are  finite  sets,  if  each  agent  communicates  all  the  values  of  v  that  minimize 
£'[c(v)|'7fl]  and  if  assumption  4  holds,  then  there  exists  some  positive  integer  M  such  that 

=  uil>  and  =  u!vf>  Vt,  Vn,  Vui  c  n. 

Strictly  speaking,  tentative  decisions  in  the  above  theorem  are  not  elements  of  U  but  subsets  of 
U.  This  is  to  compensate  for  the  possibility  of  non-uniqueness  of  optimal  tentative  decisions.  The 
equalities  appearing  in  Theorem  3  have  to  be  interpreted,  therefore,  as  equalities  of  sets. 

We  now  assume  that  the  agents  have  perfect  memory.  We  obtain  results  similar  to  theorems  I 
and  2  under  much  more  relaxed  assumptions  on  the  communications  process.  Namely,  we  only  need 
to  assume  the  following: 

Assumption  8:  Let  be  the  fc-th  message  sent  by  agent  j  to  agent  i.  We  assume  that  when  agent 
i  receives  MlJ,  he  knows  that  this  is  indeed  the  fc-th  message  sent  to  him  by  agent  j. 

Remark:  This  assumption  is  trivially  satisfied  if  messages  arrive  at  exactly  the  same  order  as  they 
are  sent,  with  probability  1. 

Theorem  4:  Under  assumptions  1.2  (convex  costs),  7  (perfect  memory)  and  8,  there  exists  a  £/-valued 
random  variable  u*  such  that  limn— 00  u),  =  u*,  Vi,  in  probability  and  in  Li(Q, 9, p). 

For  estimation  problems  (uj,  =  E[z|9)J),  theorem  4  can  be  slightly  strengthened:  (5,  Theorem  2] 

Theorem  5:  For  estimation  problems,  under  the  assumptions  of  theorem  4,  convergence  to  u*  takes 
place  with  probability  1. 
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We  now  consider  the  case  where  U  is  finite  but  (unlike  theorem  3)  0  is  allowed  to  be  infinite. 
Several  complications  may  arise,  all  of  them  due  to  the  fact  that  optimal  decisions,  given  some 
information,  are  not  guaranteed  to  be  unique.  We  discuss  these  issues  briefly,  in  order  to  motivate 
the  next  theorem. 

Suppose  that  U  =  {«i,t>2}.  It  is  conceivable  that  £[c(t>[)|9jj  —  £[e(v2)|^]J  is  never  zero  and 
changes  sign  an  infinite  number  of  times,  on  a  set  of  positive  probability.  In  that  case,  the  decisions  of 
agent  t  do  not  convege.  Even  worse,  it  is  conceivable  that  £7[c(wi)|^FJJ  >  £'(c(t>2)|*FJJ  and  £[c(t/i)|®ffi]  < 
£[c(u2)|3F£]  ,  for  all  n  and  for  all  a;  in  a  set  of  positive  probability,  in  which  case  agents  i  and  j  disagree 
forever.  It  is  not  hard  to  show  that  in  both  of  the  above  cases  •E,[e(t>i)|9Jc]  =  2?[c(t>2)|5£0],  on  a  set  of 
positive  probability  and  this  non-uniqueness  is  the  source  of  the  pathology.  The  following  theorem 
states  that  convergence  and  agreement  are  still  obtained,  provided  that  we  explicitly  exclude  the 
possibility  of  non-uniqueness. 

Theorem  6:  Under  assumptions  1.1  (finite  U)  and  7  (perfect  memory)  and  ir  the  random  variable  u* 
that  minimizes  £[c(w)]  over  all  ^immeasurable  random  variables  is  unique  up  to  a  set  of  measure 
zero,  for  all  i,  then  limn_o0u]l  =  u‘,  almost  surely,  and  u*  =  «*,  V*,j. 

Although  the  preceding  theorems  guarantee  that  (under  certain  conditions)  all  agents  will  agree, 
nothing  has  been  said  concerning  the  particular  decision  to  which  all  agents’  decisions  converge.  In 
particular,  it  is  not  necessarily  true,  as  one  would  be  tempted  to  conjecture,  that  the  limit  decision  is 
the  optimal  centralized  solution  (that  is,  the  solution  to  be  obtained  if  all  agents  were  to  communicate 
all  their  information).  On  the  other  hand,  the  centralized  solution  is  reached  for  LQG  problems, 
under  the  perfect  memory  assumption  [6]  and  is  also  reached  gencrically  for  an  estimation  problem 
on  a  finite  probability  space  [7],  This  issue  will  be  touched  again  in  the  next  section. 

Robustness  with  respect  to  Communication  Noise. 


Schemes  that  centralize  information  by  coding  (e.g.  by  using  the  least  significant  bits  of  the  al¬ 
lowed  messages  [1,13]  tend  to  require  high  bandwidth  and  are  sensitive  to  noise  in  the  communication 
channel.  In  our  scheme,  although  real  numbers  are  being  transmitted  (infinite  information  content), 
the  least  significant  bits  are  not  as  essential.  As  a  result,  the  qualitative  convergence  properties  of 
our  scheme  are  retained  even  if  communications  of  the  tentative  decisions  arc  assumed  to  be  noisy.  . 
We  provide  a  proof  of  this  fact  for  estimation  problems,  under  the  perfect  memory  assumption. 
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Suppose,  as  before,  that  at  random  times  agent  j  communuicatcs  his  optimal  tentative  decision 
<  However,  the  message  received  by  the  other  agents  is  uJn  =  uf,  +  where  ^  is  a  random 
vector  representing  the  noise  in  the  channel.  For  simplicity,  we  assume  that  the  noise  vectors  are 
independent,  identically  distributed. 
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Theorem  7:  Assume  noisy  communications  (as  described  above).  For  estimation  problems,  under 
assumption  7  (perfect  memory),  there  exists  a  [/-valued  random  variable  u*  such  that  lim,,-**!**  = 
u*,  Vt,  with  probability  1. 
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IV.  THE  LINEAR  QUADRATIC  GAUSSIAN  (LQG)  MODEL. 


In  this  section  we  specialize  and  strengthen  some  of  our  results  by  restricting  to  the  Linear  Quadratic 
Gaussian  model  described  in  section  2.  (Recall  that  any  such  problem  is  equivalent  to  an  estimation 
problem;  therefore,  tij,  =  £"  =  £[z|9)J,  for  some  random  vector  x.)  Theorems  1,  3  and  4  are 
applicable.  Moreover,  the  results  of  (6]  guarantee  that,  under  assumption  7  (perfect  memory),  uj, 
converges  to  the  optimal  centralized  estimate,  given  the  information  possessed  by  all  agents.  The 
following  theorem  states  that  the  same  is  true  under  the  weaker  assumption  6. 

Theorem  8:  For  the  LQG  problem,  under  the  assumptions  of  Theorem  1  and  assumption  6  (imperfect 
memory;  own  data  remembered),  lim„_ao  x'n  —  x,  in  the  mean  square,  where  x  =  £{z|'J0O]  and  *foo 
is  the  smallest  cr-ficld  containing  ?)„  for  all  i,  n. 

Note  that  theorem  8  is  much  stronger  than  theorem  1  which  was  proved  for  the  general  case  of 
imperfect  memory.  We  have  here  convergence  to  a  limit  solution  which  is  also  guaranteed  to  be  the 
optimal  centralized  solution. 

Our  next  result  concerns  the  finite  dimensional  LQG  problem  in  which  the  total  number  of 
observations  is  finite.  Namely,  the  smallest  <7-field  containing  for  all  i,  n  is  generated  by  a  finite 
number  of  (jointly  Gaussian)  random  variables.  In  that  case,  the  centralized  solution  is  going  to  be 
reached  by  all  agents  in  a  finite  number  of  stages,  provided  that  all  agents  have  perfect  memory. 

Theorem  9:  For  the  LQG  problem  with  finitely  many  observations  and  under  assumption  7  (perfect 
memory),  the  centralized  solution  is  reached  by  all  agents  in  a  finite  number  of  stages. 

Theorems  8  and  9  imply  that  the  scheme  considered  in  this  paper  may  be  viewed  as  an  algorithm 
for  solving  static  linear  estimation  problems,  an  issue  that  we  discuss  below. 

The  intuitive  argument  behind  Theorem  9  is  the  following:  once  an  agent  has  received  enough 
messages,  he  is  able  to  infer  exactly  the  values  of  the  observations  of  the  other  agents  (or  of  some 
appropriate  linear  combinations  of  these  observations)  and  compute  the  centralized  solution  himself. 
So,  communicating  optimal  tentative  decisions  is  in  this  case  just  another  way  for  communicating  all 
information  to  all  other  agents.  This  scheme  does  not  seem  to  have  any  particular  advantages  (in  terms 
of  communication  and  computation  requirements)  over  the  scheme  where  each  agent  communicates 
all  his  data  directly. 

However,  the  scheme  of  Theorem  8  (imperfect  memory)  seems  to  have  some  appealing  features, 
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as  we  indicate  below.  Suppose  that  we  have  a  single  processor  who  obtains  a  NM -dimensional 
vector  of  observations.  He  then  divides  his  observations  into  N  M -dimensional  vectors  that  will 
play  the  role  of  the  agents  of  our  scheme.  Finally,  the  processor,  instead  of  inverting  the  NM  X  NM 
covariance  matrix  to  obtain  the  optimal  estimate  (which  would  require  0(/V3A/3)  operations),  he 
uses  the  scheme  of  Theorem  8.  At  each  round  there  will  be  one  inversion  per  block  of  data,  that 
is  0(NM 3)  operations  per  round.  If,  for  example,  an  acceptable  estimate  is  obtained  after  O(N) 
rounds,  the  final  objective  will  have  been  accomplished  with  a  total  of  0[N2MJ)  operations,  which 
is  one  order  of  magnitude  less  than  the  usual  algorithm.  It  is  not  hard  to  show  that  if  the  noises  in 
observations  belonging  to  different  blocks  are  uncorrelated,  agreement  is  obtained  after  two  rounds 
only.  Accordingly,  if  the  noises  in  observations  in  different  blocks  are  weakly  correlated,  we  expect 
our  scheme  to  be  faster  than  the  standard  algorithm.  We  present  below  some  numerical  results  that 
support  the  above  statements.  So,  our  scheme  leads  to  a  potentially  advantageous  decomposition 
algorithm  for  static  linear  estimation  problems.  (This  algorithm  has  some  conceptual  similarities 
with  those  suggested  in  [8].) 

We  now  discuss  some  issues  related  to  the  distributed  implementation  of  the  decomposition  al- 
gorithm,  where  each  block  of  data  actually  corresponds  to  a  physically  distinct  agent  (processor).  For 
any  t,  n,  x'n  —  a'ntj,  where  aj,  is  a  row  vector  and  y  is  the  vector  of  all  available  observations.  When 
agent  j  receives  zj,,  he  must  also  learn  aj,,  in  order  to  be  able  to  extract  information  from  iln.  There 
are  two  choices:  Either  a)  agent  j  computes  a)„  which  may  be  done  off -line,  or,  b)  agent  t  transmits 
aj,  to  agent  j.  Which  of  the  two  should  be  done  clearly  depends  on  whether  communications  or 
computations  are  more  costly.  Whether  one  of  the  above  two  variations  can  be  useful  depends  on  the 
particularities  of  the  actual  situation  and  its  inherent  communication  and  computation  limitations. 
More  numerical  experience  is  needed  before  a  definite  answer  can  be  given. 

Numerical  Results. 

Let  z  be  an  unknown  scalar,  zero  mean,  random  variable  to  be  estimated  (£7[x2]  =  5).  Let 
j/i  =  x  -f  tt’„  (i  =1,.. .,18)  be  the  observations.  The  noises  w,  are  assumed  to  be  independent  of  z. 
(The  covariance  Eu,  of  the  noises  was  randomly  generated.)  We  split  the  18-dimensional  observation 
vector  into  blocks  of  data  (corresponding  to  distinct  agents)  and  used  the  decomposition  algorithm 
of  Theorem  8.  We  employed  the  ring  protocol  and  assumed  that  at  each  stage  an  agent  only  knows 
his  own  observations  and  the  most  recent  message  he  received  (Assumptions  5,  6). 

Let  Mi  be  the  number  of  observations  assigned  to  agent  t.  We  considered  two  alternative 
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decompositions:  (i)  N  =  2,  M\  =  10,  Mi  —  8;  (ii)  N  =  6,  Mi  =  . . .  =  A4q  =  3.  We  first  executed 
the  algorithm  using  the  covariance  and,  then,  once  more  using  the  covariance  -f*  I- 

The  results  are  presented  in  Figures  1,  2.  The  horizontal  axis  denotes  stages  (each  stage  cor¬ 
responds  to  an  update  by  some  agent)  and  the  vertical  axis  indicates  the  associated  mean  square 
error.  The  dotted  horizontal  line  indicates  the  centralized  mean  square  error.  The  curves  Dl  and 
D2  correspond  to  the  first  and  second  decomposition,  respectively.  As  expected,  convergence  was 
much  faster  when  the  identity  was  added  to  the  initial  covariance;  moreover  the  first  decomposition 
converged  much  faster  than  the  second. 

To  illustrate  the  merits  of  the  decomposition  algorithm  we  performed  a  rough  Count  of  opera¬ 
tions.  We  only  took  matrix  inversions  into  account,  assuming  that  the  inversion  of  a  M  X  M  matrix 
requires  M3  operations,  which  is  accurate  enough  for  our  purposes.  With  this  counting  scheme, 
the  centralized  algorithm  required  5832  operations.  The  points  A,  B  in  the  graphs  were  reached 
after  1100,  1152  operations,  respectively.  This  leads  to  the  following  conclusion:  While  the  first 
decomposition  needs  very  few  stages  to  converge,  it  does  not  have  any  particular  computational 
advantages.  The  second  decomposition,  however,  leads  to  an  estimate  close  to  the  optimal  with 
much  fewer  operations  than  the  centralized  algorithm. 
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V.  A  MODEL  INVOLVING  A  COORDINATOR, 

In  the  previou*  sections  we  had  assumed  that  for  any  pair  of  agents  i,  j,  agent  t  is  allowed  to 
communicate  to  j.  In  this  section  we  assume  that  a  particular  agent  (denoted  by  the  superscript 
o)  has  special  status  and  acts  like  a  coordinator.  The  scheme  we  envisage  is  the  following:  At  each 
instance  of  time  n,  agent  t  evaluates  u'n  which  he  communicates  to  the  coordinator.  The  coordinator 
then  combines  uj,  to  to  produce  a  tentative  decision  u“ .  We  assume  that  the  coordinator  has  no 
data  of  his  own.)  He  then  transmits  u®  to  all  other  agents  which  accordingly  update  their  decisions. 
Were  the  coordinator  to  cor  .bine  uj,  to  “optimally”,  the  above  scheme  would  reduce  to  the  one 
of  the  previous  sections  and  our  past  results  would  apply.  We  assume,  however,  that  the  coordinator 
simply  sets 


s 


i—i 


where  the  coefficients  a*  are  deterministic,  positive  and  o'  —  1.  The  implicit  behavioral  as¬ 
sumptions  are:  (i)  The  coordinator  has  no  memory  and  (ii)  he  need  not  have  a  good  knowledge  of 
the  problem.  He  only  knows  how  much  he  can  rely  on  each  of  the  other  agents;  this  is  reflected  by 
his  choice  of  the  coefficients  a'  which  may  be  thought  of  as  a  “reliability  index”  for  agent  i  in  the 
eyes  of  the  coordinator.  We  then  obtain: 

Theorem  10:  The  conclusions  of  theorems  I,  4,  8,  remain  true  (under  their  respective  assumptions) 
with  the  scheme  introduced  in  this  section. 

The  above  scheme  can  be  viewed  as  a  framework  for  cooperation,  where  the  coordinator  simply 
aids  the  agents;  or,  for  LQG  problems,  as  another  decomposition  algorithm.  It  can  be  also  inter¬ 
preted,  however,  from  an  entirely  different  point  of  view:  Suppose  that  the  agents  are  selfish  and 
independent  individuals,  faced  with  identical  situations,  possessing  different  information  and  having 
to  make  repetitive  decisions.  They  can  certainly  benefit  by  observing  past  decisions  of  the  other 
agents  but  assume  that  this  is  not  possible.  They  are  able,  however,  to  observe  a  weighted  average 
u°  of  all  decisions  made  in  the  last  stage,  which  they  take  into  account  for  their  future  actions.  The 
motivation  for  such  a  model  comes  primarily  from  economics:  Each  agent  is  a  buyer  (or  seller)  in 
the  same  market  and  at  each  stage  he  obtains  some  aggregate  information  (e.g.  the  average  price) 
on  the  transactions  that  were  made  in  the  previous  stage.  In  this  sense  the  “coordinator”  simply 
represents  a  market  mechanism.  Our  results  state  that,  eventually,  an  informational  equilibrium  wilt 
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be  reached.  Such  an  equilibrium  has  been  studied  by  Radner  [12]  in  a  different  setting.  However, 
there  was  no  demonstration  of  an  adjustment  process  that  could  lead  to  such  an  equilibrium.  Our 
scheme  provides  a  model  of  rational  behavior  which,  if  followed  by  each  agent,  leads  to  equilibrium. 

Moreover,  within  such  a  context  (of  selfish  individuals  confronted  with  identical  situations)  and 
for  LQG  problems  with  perfect  memory,  optimal  tentative  decisions  constitute  a  set  of  strategies 
in  Nash  equilibrium  for  a  certain  game.  (This  is  why  optimal  tentative  decisions  can  be  called  “a 
model  of  rational  behavior”.)  Let  us  define  the  game  of  interest  more  precisely. 

Let  y'n  be  a  vector  of  jointly  Gaussian  random  variables  that  generate  G^  the  o-algebra  of 
events  known  to  agent  t  at  time  n  if  he  had  received  no  messages.  At  each  stage,  agent  t  selects  a 
decision  uj,  and  incurs  (but  does  not  observe)  a  cost  anc(ti](),  where  0  <  a  <  1  and  e  is  a  quadratic 
cost  function.  Then,  u°n  =  Ea'uJ,  is  formed  and  communicated  to  all  agents.  The  total  cost  to  agent 
i  is  /*  —  E^_lanE,[<^u,*l)].  A  strategy  for  agent  i  is  a  sequence  {-yj,,  i  *=  1,2, ...}  of  measurable 

functions  such  that  u*„  =  7],(yJ,,  u°, . . .,  u“_t).  Let  T  =  {P:  t  =  1 . N)  denote  the  particular  set 

of  strategies  where  each  agent  at  each  stage  plays  his  optimal  tentative  decision.  (Note  that  these 
are  linear  strategies.)  Then, 

Theorem  11:  T  is  a  set  of  strategies  in  Nash  equilibrium. 
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A  set  of  agents  with  the  same  objective  who  start  communicating  to  each  other  their  tentative 
optimal  decisions  are  guaranteed  to  agree  in  the  limit.  Under  certain  assumptions,  this  is  true  even 
if  the  decisions  are  received  in  the  presence  of  noise  and  even  if  their  memory  is  limited  and  they  are 
allowed  to  forget  some  of  their  past  knowledge.  Moreover,  they  are  guaranteed  to  converge  to  the 
optimal  centralized  decision  for  linear  estimation  problems,  provided  that  they  do  not  forget  their 
own  observations.  This  leads  to  a  decomposition  algorithm  for  static  linear  estimation  problems. 
Similar  results  are  obtained  if  the  agents  do  not  communicate  directly  but  receive  messages  from  a 
coordinator  who  evaluates  a  weighted  average  of  all  tentative  decisions.  In  the  latter  framework,  for 
LQG  problems  with  perfect  memeory,  optimal  tentative  decisions  are  Nash  strategies  for  a  certain 
sequential  game  and  admit  an  economic  interpretation. 

These  results  are  valid  when  all  agents  share  the  same  model  of  the  world.  The  characterization 
of  the  behavior  of  agents  with  different  models  (perceptions)  is  an  open  problem. 
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Proof  of  Lemma  1:  Let,  for  notational  convenience,  g(w,  t/)  =  £(c(t>)|9JJ.  Under  assumptions  1.2 
and  2,  g(u,  v)  can  be  chosen  so  that  it  is  jointly  measurable,  strictly  convex  in  v  and  converges  to 
infinity  as  v  converges  to  infinity,  for  any  ugQ.  Hence,  Vw  € fl,  the  infimum  of  g(w, v)  is  attained 
by  some  u|,(w)  which  is  unique,  because  of  strict  convexity. 

Let  Q  —  be  a  countable  dense  subset  of  U.  Then,  by  continuity  of  g,  inf g(u»,  u) 

=  inf„eo  v)>  Vui  €  ft-  Moreover,  inf„e<j  g(ui,  v)  is  “/^-measurable.  Let  =  qkl  where  k  is 

the  smallest  index  such  that 

®(«*  <7fc)<  inf  g(w,t/)+  -J-. 

Then  4>,n  is  ^-measurable  and  converges,  for  each  u»  to  u*n.  Hence,  u*n  is  ®F’f, -measurable.  Inequality 
(3)  now  follows  from  the  definition  of  u‘n  and  uniqueness  is  a  consequence  of  strict  convexity.  The 
measurability  of  u),  under  asumption  1.1  is  trivial.  | 

Lemma  2:  Let  {un},  {w,,}  be  two  sequences  of  [/-valued  random  variables  such  that 

lim  =  ,i,n  £fc(“<»)]  =  lim  £[c(u>r,)].  (4) 

n— oo  L  \  ^  / J  n— oo  n— oo 


If  assumptions  1.2  and  2  hold,  then  Iim„_w>(un  —  w„)  —  0  in  L2(Sl,9,n)  and  in  probability. 
Proof:  By  assumptions  1.2,  2  and  equation  (4), 


lim  E(/l(w)||u„  —  lim  E\ 


<K«n)  +  c(u/„) 

/u„-f  wn\ 

2 

\  2  j\ 

(5) 


which  shows  that  (un  —  wn)  converges  to  zero  in  L^ft,  9,  n).  Therefore,  it  also  converges  in  measure 
with  respect  to  fi. 

Recall  that  /l(w)  >  0  and  n(B)  —  faA(uj)  d9(ut),  VB  €  9.  Therefore,  n(B)  =  0  implies  9(B)  —  0 
and  9  is  absolutely  continuous  with  respect  to  fi.  Let  &n  —  {!«„  —  to„|  >  e}.  Since  (u„  —  w„)  con¬ 
verges  to  zero  in  measure  n,  for  any  e  >  0,  we  have  limn_ooA*(^,)  =  0  and,  by  absolute  continuity, 
lim„— oo^CB*,,)  38  0,  which  shows  that  we  have  convergence  in  probability.  | 


Proof  of  Theorem  1:  We  start  with  the  proof  under  assumption  4.  Since  uj,  is  7*rt+1-mcasurable,  we 
have  (by  the  minimizing  property  of  u»»-h)  £7(c( ! )} <JE7 I^-u* ,)) .  Since  c  is  nonnegative,  £[c(tt'n)] 


converges  to  some  constant  gl.  We  also  note  that  (u‘n+1  -f*  uJJ/2  is  9\+  (-measurable  and  by  taking 
the  limit  in  the  relation 

S  >  EW<*Ul)l 

wc  obtain  limn_*jo£(c((u{,+l  +  u‘„)/2) ]  =  gx.  Lemma  2  then  yields  the  first  part  of  the  theorem. 

Let  j  €  A[i).  Then  there  exist  sequences  {m*,}  and  {nt}  of  positive  integers  such  that 
limfc_oom*  —  limjj-^o  =  oo  and  m^,  are  the  times  of  transmission  and  reception,  respectively, 
of  the  &-th  message  from  agent  j  to  agent  Therefore,  is  5;^- measurable,  for  ail  k,  and 
i57(c(t<Jtfc)J<£7[c(«'/„fc)l  which  shows  that  g'<gi.  Using  assumption  3,  we  conclude  that  g‘  ~  g*,  Vi,  j. 
We  note  that  (tin*  +  u^J/2  is  Tn|j- measurable  and,  therefore, 

{m<)  +  c«)]  >  y^')]  -  ^Ol  (6) 

Taking  the  limit  in  (6},  using  Lemma  2  and  the  boundedness  assumptions  on  the  communications, 
wc  obtain  the  second  half  of  the  theorem. 

We  now  assume  assumption  5.  Let  t(n)  be  a  sequence  of  agents  such  that  i(n)  €E  /(«),  Vn.  Then, 
uj(m)  js  'f^'fl-measurable  lind  £7[c(i*J^ ,,))]  is  a  decreasing  sequence.  Similarly  with  the  first  part  of 
the  proof,  we  conclude  that  ^  converges  to  zero  in  L^Q,  9,  /*)  and  in  probability.  It 

follows  that  —  u‘n  and  it),  —  u-J,  converge  to  zero,  for  j  6  A{i).  Using  assumption  3,  uln  —  uj, 
converges  to  zero,  for  all  i,  j.  | 

Proof  of  Theorem  2:  Theorem  1  and  the  discussion  preceding  the  statement  of  theorem  2  show  that 
lim„~,oo(u;i_H  —  u),)  =  0,  in  probability  with  respect  to  9,  for  all  u/*  S  ft*.  Let  Xn(u>,  ui*)  be  the 
characteristic  function  of  the  set  lju^l  —  u,'J|  <  e}.  Then, 

lim  [ Xnd{' 9x9*)  =  lim  f  j  x„d*Pd?P*  =  /  lim  [  Xn^9d9*  =  1 

n— *oo  J  n— •  oo  J  J  J  n— *oo  J 

(The  first  equality  holds  by  the  Fubini  theorem;  the  second  by  the  dominated  convergence  theorem; 
the  third  by  convergence  of  (uj,+l  —  u*n)  to  zero,  with  respect  to  the  probability  measure  9.)  This 
shows  that  u’n^_l  —  u‘„  converges  to  zero  in  probability  with  respect  to  9  X  9*.  Similar  steps  show  . 
that  u‘n  —  u(,  also  converges  to  zero,  in  probability.  | 
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Proof  of  Corollary  1:  By  theorem  1,  the  U  X  [/-valued  sequence  of  random  variables  (u|,— u‘n+l  — 
u‘n)  converges  to  (0, 0)  in  probability.  It  therefore  contains  a  subsequence  converging  to  (0, 0),  almost 
surely.  Therefore,  V7  >  0  and  for  almost  all  w  £  ft,  3n„  such  that  luj^  — u£j  <  7,  |uj^+l  —  ujj  <  7 
aiul  the  termination  condition  is  eventually  satisfied  with  probability  one.  | 

Proof  of  Theorem  3:  Because  of  the  finiteness  of  0,  there  exists  a  finite  (non-random)  time  after 
which  communications  (conditioned  on  past  events)  are  deterministic.  We  may  take  that  time  as 
the  initial  time  and  assume,  without  loss  of  generality,  that  all  communications  arc  deterministic. 

Let  u'n  be  the  set  of  elements  of  U  which  are  optima',  given  Tn.  Let  to),  denote  a  «Fj,- measurable 
random  variable  such  that  w'n(w)  £  uj,(w),  Vw  £  ft.  (Note  that  E\c{w\^\  is  independent  of  how 
rv\  has  been  selected.)  By  finiteness  of  ft  and  U,  there  exist  finitely  many  [/-valued  random  vari¬ 
ables  and,  since  ^(c(tw;44.1)]<E[c(ti>,ra)],  we  conclude  that  there  exists  some  positive  integer  T  and 
some  g{  such  that  E(c(u;;j)  =  g{,  Vn  >  T.  For  any  n  >  T,  £(c(u/;,)]  =  E[c(u;^+1)l  and  since 
w',i is  ^measurable,  w‘n  minimizes  £fc(u>)l  over  all  «F‘n+I-measurable  random  variables.  Hence, 
u>|,(w)  £  u;i+l(w),  Vuj  £  ft  which  shows  that  uf‘,(u/)  C  u),+l(w),  Vw  £  ft.  Again,  by  fmiteness  of  U 
and  ft,  there  exists  some  positive  integer  M  such  that  uJH.,(u>)  —  uj,(ai),  Vn  >  M,  Vw  £  ft,  Vi. 

If  j  £  A(t),  there  exist  m,  n  >  M  such  that  w-jn  is  “J^-measurable  and  this  shows  that  p/<p». 
By  assumption  3,  we  obtain  g‘  —  gJ,  Vi ,j.  Therefore,  wJm  minimizes  £[c(u;)]  over  all  ^-measurable 
random  variables  and,  therefore,  wjrl( w)  £  u|,(ui),  or,  ujn[u)  C  u^(ai),  Vw  £  ft.  Recalling  assumption 
3,  we  obtain  uJ„ ,(w)  —  u;,(w),  Vi,  j,  Vm,  n  >  M,  Vw.  | 

Lecpma  3-  Let  T  be  a  finite  stopping  time  of  an  increasing  family  {CJ„}  of  o-ficlds.  Let  u,„  n  = 
1,  2, . . .  be  random  variables  that  minimize  £'[c(u/)|^f„],  almost  surely,  over  all  ^-measurable  random 
variables  w.  Then,  uj  minimzes  £*[c(tc;)]  over  all  57-measurable  random  variables  w,  where  ut  is 
defined  to  be  equal  to  u„  if  and  only  if  T  =  n. 

Propfr  Let  Xn  b®  the  indicator  function  of  the  set  (w:  T[w)  =  n).  Since  T  is  a  stopping  time,  Xn  is 
‘Jn-measurablc.  Note  that  Xn^Un)  =  X»c(ut)-  Let  w  be  a  '?7^measurable  random  variable  and  note 
that  XtAw)  =  Xn^Xu*")  an<l  Xn10  is  9,1-measurable.  Therefore, 

^[XnC^JI^n]  =  Xij£[c(Xnu,)l^n]  ^  Xn^(c(un)|^n]  =  £[Xwc(un)|^n)  ==  £'[Xiic(ur)|'Tn) 

Taking  expectations,  we  obtain 

£(XnCM]  >  £T(XnC(ur)] 
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and  summing  over  all  n’s  (and  using  the  monotone  convergence  theorem  to  interchange  summation 
and  expectation)  we  obtain  Ef^u/)]  >  E(c(ur)}.  | 

Proof  of  Theorem  4;  Since  u*„  is  !7)l+1-measurable,  we  have  Ejc(u‘t+1))<E{c(u|t)).  Since  c  is  non¬ 
negative,  E(c(u|,)j  converges  to  some  constant  g'.  We  also  note  that  (u),  +  un-|-in)/2  *8  ^h+m~ 
measurable.  Therefore,  £(c((ti,‘,+fn  +  uJt)/2)J  >  E[c(uJ,+m))  >  gl.  Fix  some  e  >  0,  and  let  n  be  large 
enough  so  that  £[c(u), )]<<;'  -j-e.  Then,  using  assumption  3,  we  obtain  E(/\(w)||u)1+„,  —  «}i||,]<e, 
Vm  >  0.  Therefore,  {ttj,}  is  a  Cauchy  sequence  in  Z/2(fl,  9,  m)-  By  the  completeness  of  Lj  spaces, 
there  exists  a  t/-valued  random  variable  it*  such  that  limn^oouj,  =  u‘,  in  Lri(n,1 F,jt)  and,  therefore, 
in  probability,  with  respect  to  9.  (The  proof  of  the  last  implication  is  contained  in  the  proof  of 
Lemma  2.)  Since 


EiBW<+,)i»t+,]py<EiBW«i)in+iiw = em-upu. 

{£,|c(u|J|‘Jj1],  n  —  1, 2, . . .}  is  a  supermartingale,  with  respect  to  (5),}. 

Moreover,  since  for  any  fixed  v  £  t/,  £{c(u}J|‘J,n]<E[c(u)|<3:|J,  it  is  a  uniformly  intcgrable 
supermartingale  (10,  Theorem  T19,  p.90|. 

Let  j  £  /t(i).  Let  mi-,  nk  be  the  times  of  transmission  and  reception,  respectively,  of  the  k- 
th  message  from  j  to  i.  Because  of  assumption  8,  m k  and  nk  arc  stopping  times  of  {CJ7},  {GP,l}, 
respectively,  and  since  j  £  /l(t),  they  are  almost  surely  finite  stopping  times,  for  all  k.  Moreover, 
/c<mfc<nt  and  by  the  optional  sampling  theorem  [10,  Theorem  T28,  p.90] 


E[c{4)  }>E[c{niJ\>gi 


which  shows  that  linrtjt— ooEWu}^)]  =  gl.  Similarly,  limfc_oo£Ic(  «?,,*)]  =  g 7. 

Note  that  uJmk  is  5'nji-measurable  and,  by  Lemma  3,  E[c(u^,J]  >  E[c(uJlk)J.  Taking  the  limit, 
we  obtain  g7  >  gl,  and  by  assumption  3,  g'  =  g7  =  g,  Vi,j. 

We  now  take  the  limit  of  the  inequalities 


2£(C(U^)  +  CK 


m*/ 


>£[e(“.k±<j 


>E\c(u\J 


to  obtain  limt-00£,(c((uJiji  4.  u;,  J/2)]  =  g  and,  by  Lemma  2,  Iim*-*,^  —  u7  J  —  0,  in  L>(Cl,  . 
and  in  probability. 
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We  also  take  the  limit  of  the  inequalities 


2E(cKJ  +  <=«•)] 


>  ^[c(uL)j 


to  obtain  limbec  £(c((<4  4-  tt‘nk)/2)]  =  0*  and,  by  Lemma  2,  limfc-^cful,;  —  u'k)  =  0.  Similarly,  we 
obtain  limjt_*0(u4,lt  —  u{)  =  0,  which  shows  that  u*  =  ttJ,  almost  surely.  | 


Proof  of  Theorem  6:  Fix  some  v£U  and  let  B  =  {uj:  u'(uj)  =  v}.  Then,  £[c(u)|9JJ  <  £[c(v*)|9jj, 
Vu*  g  U,  for  almost  all  u i  SB.  By  the  ir.'rtingale  convergence  theorem  [10,  Theorem  T17,  p.84j, 
we  conclude  that,  for  almost  all  u  g  B,  there  exists  some  N(oj)  such  that 


E[c(v)\^J  <  £[c{v*)!Sjj  Vn  >  N(u) 


Therefore,  lim„_ao  uj,(w)  =  v,  for  almost  all  w  g  B  and,  by  considering  the  other  elements  of  U  as 
well,  limn-.ou  uj,  —  u‘,  almost  surely. 

If  j  £  A(i),  uJ  is  9^-measurable  and  £(c(uJ)]  >  £[c(u‘)].  By  assumption  3,  £[c(u*)|  =  £[c(u7)], 
Vt,  j.  Therefore,  for  j  g  A(i),  uJ  minimizes  £(c(u>)]  over  all  ^-measurable  random  variables  and  by 
the  assumptions  of  the  theorem,  u‘  =  uj,  almost  surely.  Using  assumption  3  once  more,  we  obtain 
u*  =  u>,  vt,j.  a 


Proof  of  Theorem  7:  Let  x  be  the  unknown  vector  to  be  estimated.  Then  xi‘n  =  E\x  |  ‘JF|J  and  converges 
almost  surely  to  u'  =  E[x  19^,].  Moreover,  o'  minimizes  £[c(u/)[  over  all  ^-measurable  random 
variables  to.  Let  j  £  A(i)  and  let  mk  be  the  time  of  transmission  of  the  fc-th  message  from  j  to  i.  Note 
that  -fa  YltLi  &mk 's  9J,  -measurable  (and  hence  9^-measurable)  and  converges  to  uJ  almost  surely. 
Therefore,  uJ  is  9^-measurablc  and  £[c{u')]<£[c(u■,)]  and,  by  assumption  3,  £[c(u')]  =  £[c(uJ)],  V 
t,  j.  The  minimizing  property  of  u‘  and  the  strict  convexity  of  the  quadratic  cost  function  imply 
that  ti1  ==  u>,  almost  surely. 


Proof  of  Theorem  8:  As  is  usual  in  linear  least  squares  estimation,  we  use  the  setting  of  Hilbert  spaces 
of  square  integrablc  random  variables.  Let  G  be  a  Hilbert  space  of  zero  mean,  jointly  Gaussian  ran¬ 
dom  variables  on  (0, 9,9)  such  that  each  component  of  the  unknown  vector  x  and  the  observations 
belong  to  G.  The  inner  product  in  G  is  defined  by  <  x,  y  >=  E{xy\. 

For  each  agent  t,  let  Hl  denote  the  smallest  closed  subspace  of  G  containing  all  observations 
obtained  by  him.  Let  H'n  be  the  smallest  closed  subspace  of  G  containing  all  observations  obtained 
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by  agent  i  up  to  time  n.  (Note  that  H'n  docs  not  contain  ail  random  variables  known  by  agent  i  at 
time  n,  because  it  does  not  need  to  contain  any  of  the  messages  received  by  agent  s'.)  Note  also  that, 
by  assumption  6,  [I'n  C  ,  C  Hx  and  that  the  total  knowledge  available  to  all  agents 

at  time  n.  The  centralized  estimate  is  the  projection  of  x  on  We  assume,  without  loss  of 

generality,  that  x  is  a  scalar  random  variable,  since  each  component  can  be  separately  estimated. 

Let  xi,  =  /i[j|ciJJ  and  eln  —  x  —  xln  and,  by  the  orthogonality  of  errors  and  observations,  we 
have  £jxy]  =  £[x|,i/],  Vy  €  H'n.  As  in  the  proof  of  Theorem  1  we  have  ||eJ,+,j|2<||c)J|2,  Vn,  i  which 
implies  that 

imi2  >  ii*b-hII2  £  nnii2  m 

In  particular,  (7)  implies  that  {xj,}  is  a  norm-  bounded  sequence.  By  the  weak  local  sequential 
compactness  of  Hilbert  spaces  [15,  p.126],  {£),}  contains  a  weakly  convergent  subsequence  {x^}. 
In  other  words,  there  exists  an  element  x^,  £  G  such  that  <  y,  x\H  >  converges  to  <  y,  £»  >, 
Vy  G  G.  Moreover,  xj,  G  C  an<*  s'nce  closed  subspaces  arc  also  weakly  closed 

[15,  Theorem  11,  p.125],  x ^  G  Hk.  Now  let  y  G  H'n.  Then,  <  y.xj,,  >— <  y,  x  >,  for  all  l 
such  that  rn  >  u,  which  implies  that  <  y,  ioo  >“<  V,  x  >.  Morcocvcr,  the  sequence  of  subspaces 
{//;,}  generates  //*  which  implies  that  <  y.ibo  >=<  y,x  >,  Vy  G  //*. 

By  theorem  1,  (xj, — xf,)  converges  in  the  mean  square  (and  therefore  weakly)  to  zero,  which  implies 
that  ira  is  also  a  weak  limit  point  of  {xj,u}.  The  same  argument  as  before  shows  that  <  y,  i*  >== 
<  !/>  x  >,  Vy  G  Hj,  Vj.  Therefore,  <  y,  >=  <  y,  x  >,  Vy  G  But  this  is  exactly  the 

condition  that  ioo  is  the  centralized  estimate,  given  the  observations  of  all  agents.  So,  {x[J  has  a 
unique  weak  limit  point,  which  is  the  same  for  all  i  and  coincides  with  the  centralized  estimate. 

It  only  remains  to  show  that  x),  converges  to  Xoo  strongly  (in  the  mean  square).  We  know  from 
[15,  p.120]  that  ||xco||<liminf„_00||xjl||.  On  the  other  hand, 

mi2  -  iim2  -  ii*  -  i~ii2<ii*  -  nil2 = iwi2  -  linn2 

which  shows  that  Hx^U  >  lim  sup,,.^  ||£ J,j|.  Therefore,  Hioell  =  lim,,.^,*,  ||xjj|  and  by  theorem  8, 
p.124  of  [15],  we  conclude  that  lim„_0O  || i'n  —  ioo||2  =  0.  | 

Proof  of  Theorem  9:  We  use  again  the  Hilbert  space  formalism  of  the  previous  proof.  Let  GJ,  be  the 
subspace  of  G  describing  the  knowledge  of  agent  t  at  time  n  (both  his  observations  and  the  messages 
he  has  received).  By  assumption  7,  we  have  Gln  C  G)1+I  C  G.  Since  G  can  be  chosen  to  be  finite, 
dimensional,  there  exists  some  M  (depending  on  the  sequence  of  communications  but  deterministic) 
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such  that  G‘v/_f_n  =  G\,,  Vn  >  0,  Vt.  Equivalently,  £‘u+n  =  x\{,  Vn  >  0,  V*,  and  by  theorem  8, 
=  =  ®oo»  V»,  J-  I 

Proof  of  Theorem  10:  Since  u®,  is  ^‘n+1-measurable,  it  follows  that 

*W<+i)l  <E[c«)\  =  mjt  aj<)\  <  E  aiEM<)\  (8) 

i—i  i-*i 

Also,  since  uj,  is  ^measurable,  E[c(u)J]  decreases  and  converges  to  some  g‘.  Taking  the  limit  in 
(8)  we  conclude  that  g'  =  g*,  Vi,j.  From  this  point  on,  the  proofs  of  Theorems  1,  4,  8  (with  minor 
modifications)  are  valid  and  establish  the  desired  conclusions  | 

Proof  of  Theorem  11:  Let  u^,  uj,  be  the  coordinator  messages  and  decisions,  respectively,  when  all 
agents  use  T*.  Let  9j,  be  the  smallest  o-algebra  generated  by  yj,  and  vPn, .  Now  suppose 

that  a  particular  agent,  say  agent  1,  uses  a  strategy  f*  different  from  f\  while  all  other  agents  use 
P.  Let  u°nl  uj,  be  the  coordinator  messages  and  decisions  resulting  from  the  application  of  this  new 
set  of  strategies. 

Wc  proceed  by  induction.  Clearly,  uj  is  ^[-measurable.  Assume  that  uj,  and  u“,  ...,u®_, 
are  eS\ ^measurable.  Then,  because  of  the  linearity  of  the  P's,  u|,  —  uj,,  (t^l),  is  linear  in  u®  — 
&i,  —  u®_,  and  hence  Sf), -measurable.  Since  u°  is  measurable  and  C 

(perfect  memory),  u”  is  9, '^-measurable,  and  so  is  uj,+1.  The  induction  shows  that  this  is  true  for 
all  n. 

Therefore,  by  the  minimizing  property  of  u},,  £[c(ujj]<£(c(uj,)],  Vn  and  this  completes  the 
proof.  B 
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Figure  1:  Mean  Square  errors  of  dis¬ 
tributed  algorithms  with  covariance 
dashed  line  represents  performance 

of  centralized  algorithm. 


^  ERROR 


Figure  2:  Mean  Square  errors  of  dis¬ 
tributed  algorithms  with  covariance 
E  +1:  dashed  line  represents  perfor¬ 
mance  of  centralized  algorithm. 
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